A data partitioning scheme for spatial regression
نویسندگان
چکیده
Precision agriculture data consisting of crop yield and topographic features are examined with the objective of explaining yield variability as a function of topographic attributes in order to extrapolate this knowledge to unseen agricultural sites. It is demonstrated that random data partitioning into training, validation and test subsets is not appropriate when dealing with agricultural problems characterized with strong spatial data correlation. A simple spatial data partitioning scheme that leads to significantly faster neural network training and slightly better generalization is proposed. Also, integration of predictors formed from spatially partitioned data led to improved generalization over a bagging integration procedure in experiments. The margin between the best spatial model and a trivial predictor for our precision agriculture problem was small indicating that topographic features alone could explain only a small amount of the yield variability.
منابع مشابه
Spatial Varying Coefficient Regression Model For Relative Risk Factors of Esophageal Cancer Patients
In conventional methods for spatial survival data modeling, it is often assumed that the coefficients of explanatory variables in different regions have a constant effect on survival time. Usually, the spatial correlation of data through a random effect is also included in the model. But in many practical issues, the factors affecting survival time do not have the same effects in different regi...
متن کاملSpatial Correlation Testing for Errors in Panel Data Regression Model
To investigate the spatial error correlation in panel regression models, various statistical hypothesizes and testings have been proposed. This paper, within introduction to spatial panel data regression model, existence of spatial error correlation and random effects is investigated by a joint Lagrange Multiplier test, which simultaneously tests their existence. For this purpose, joint Lagrang...
متن کاملPatchwork Kriging for Large-scale Gaussian Process Regression
This paper presents a new approach for Gaussian process (GP) regression for large datasets. The approach involves partitioning the regression input domain into multiple local regions with a different local GP model fitted in each region. Unlike existing local partitioned GP approaches, we introduce a technique for patching together the local GP models nearly seamlessly to ensure that the local ...
متن کاملAssessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories
In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...
متن کاملThe R⊕-tree: Incorporating Object Partitioning into the R-tree
During the past three decades, researchers devoted much effort to developing efficient techniques to index spatial data. Of those proposed, the R-tree[1] is perhaps the most important. R-trees are ubiquitous in commercial database management systems, and they find myriad applications in other disciplines as well. In this paper, we seek to improve the state-of-the-art R-tree[2] by introducing sp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999